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Abstract 

The possibility of coexistence of two competing populations is a 
classical question which dates back to the earliest 'predator-prey' mod- 
els. In this paper we study this question in the context of a model 
for the spread of a virus infection in a population of healthy cells, 
introduced in [3]. The infected cells may be seen as a population of 
'predators' and the healthy cells as a population of 'prey'. We show 
that, depending on the parameters defining the model, there may or 
may not be coexistence of the two populations, and we give precise 
criteria for this. 

1 Introduction 

We start by giving an informal description of the model studied in this paper. 
It is a two-dimensional Markov process (X(t),Y(t)) t >o, where X(t) is the 
number of 'healthy cells' at time t, and Y(t) is the number of 'infected cells' 
(i.e. cells having virus in them). Both components (X(t))t>o and (Y(t)) t >o 
behave in many ways like branching processes, although there are dependen- 
cies between them. A healthy cell is replaced by a random number of new 
healthy cells at rate 1. This random number is independent of other events 
and drawn from a distribution (pk)k>o] thus the rate at which a healthy cell 
is replaced by k healthy cells is pu- Infected cells are also replaced by k new 
(infected) cells at rate pk if k > 1 while they are replaced by new cells (die) 
at the higher rate Po + X. Here A > is a parameter that reflects the negative 
impact of the virus on the host's lifelength. When an infected cell dies (i.e. 
is replaced by new cells), it converts a random number of healthy cells into 
infected cells. The biological motivation is that when infected cells die they 
burst (lyse) and release 'free virions' which enter a random number of healthy 
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cells, thus infecting them. The number of conversions is independent of all 
other events, and is drawn from a distribution (7fc)fc>o- Hence, the processes 
(X(t)) t >o an d (Y(t)) t >o interact in that (Y(t)) t >o 'feeds' upon (X(t)) t > . The 
model is defined in detail in Section 2. Also, we refer the interested reader to 
[3] for a biological motivation of the model. We will sometimes simply write 
X or Y as a shorthand for (X(t)) t >o and (Y(t)) t >o respectively. 

As described, the model is in essence a pair of interacting branching 
processes. Markov branching processes with interaction have been much 
studied, see for instance [12] and the references within. The main purpose 
of this paper is the study of coexistence of the two, competing, populations 
X and Y. Similar types of questions have been studied in many contexts. 
One recent example is the so-called two-type-Richardson model. This can 
be informally described as follows. Consider the graph Z d , and let the two 
infections (red and blue) start with only one individual each. A site is infected 
by the red (blue) process at a rate which equals the infection parameter 
A r (Aft) times the number of neighbours infected by the red (blue) process. 
Further, if a site gets infected by the red infection it stays red forever and 
similarly if it is infected by the blue infection. The main question is if they 
can coexist, i.e. if there will be two unbounded components of red and blue 
sites, see for instance [6, 4, 11]. 

In [3], much focus was on the study of the extinction probability r] of 
the infected process (Y(t)) t >o- There, rj was taken as an indicator of the 
'evolutionary fitness' of the virus. The main result was that for fixed (pk)k>o 
and (7fc)fc>o satisfying 7o = the extinction probability rj is maximized when 
A = 0. In fact, it was shown that r] is increasing in A. The main result of 
this paper concerns the coexistence probability (. 

Definition 1.1. We call 

C = P(X(t)Y(t) > 1 for all t > 0). 
the coexistence probability of X(t),Y(t). 

Of course, ( depends on the parameters used to define the process, but 
we suppress this dependence in the notation. Introducing the stopping time 
T u = inf{t > : X(t)Y(t) = 0}, we have that C = P(T U = oo). The relevance 
of coexistence in the study of rj will be discussed in Section 6. 

The proof of our main result uses two auxiliary branching processes X(t) 
and Y(t) defined and discussed in detail in Section 3.1. Informally, X(t) is 
a process distributed as X(t) without the influence of Y(t), i.e. Y(0) = 0. 
Furthermore, Y(t) is a process distributed as Y(t) with an infinite supply 
of healthy cells, i.e. A(0) = oo. Our main result is formulated in terms 



2 



of the so-called malthusian parameters for these processes, denoted by a 
and j3 for X(t) and Y(t) respectively (see Section 3.1). It turns out that 

a = Efclo k Pk - 1 and P = a + Po Efclo k ^k + MEfcLo fc 7fe - !)■ 

Theorem 1.2. For arbitrariy initial conditions X(0), Y(0) > 1 and offspring 
distributions (pk)k>o with finite second moment, the coexistence probability 
is positive if and only if a > (3 > 0. 

Remark 1.3. Note that coexistence is only possible if 7 > 0, because 
otherwise /3 > a. This follows from the expressions for a and /3 and Theorem 
1.2: if 70 = then Y cannot die out as long as X survives. Hence, there is 
then almost surely a time t > such that X(t) =0 and Y(t) ^ 0. (This 
result was announced in [3] as part of Proposition 3.2.) 

Theorem 1.2 establishes, under a second moment condition, for which 
values of a and j3 we can have coexistence. Our next result strengthens the 
second part of Theorem 1.2. Recall that T u = inf{t > : X(t)Y(t) = 0}. 

Theorem 1.4. For offspring distributions (pk)k>o with finite second moment, 
and for any choice of a < (5 we have that E[T U ] < oo. 

We have not been able to establish in general if T u has finite or infinite 
expectation when a — f3. (But see Remark 5.1 for a special case.) 

On the way to proving that coexistence is indeed possible (when a > (3) 
we use general facts about order statistics and trimmed sums, see Lemma 3.4. 
The second part of that lemma is an interesting application of Harris' inequal- 
ity [8] to bound the variance of a trimmed sum, which we have not found in 
the literature. 

In the two-type Richardson model mentioned above, coexistence is con- 
jectured to hold if and only if A r = The 'if condition has been established, 
see [6, 4, 11], while [7] makes progress on the 'only if condition. In fact, the 
model studied here is closer to the following variant of the two-type Richard- 
son model. If a site is infected by the blue process, it changes color if the 
red process attempts to infect it, while if a site is infected by the red process 
it stays so forever. That is, a red site is immune to the blue process while a 
blue site is not immune to the red infection. Analogy with the model studied 
in this paper suggests that there can then be coexistence if > A r , but not 
if Aft < A r . 

We end this section with an outline of the rest of the paper. In Section 2 
we give a precise definition of the model. In Section 3 we state and prove 
preliminary results needed in the proofs of our main results. In Sections 4 
and 5 we prove Theorems 1.2 and 1.4 respectively. Finally we discuss some 
applications of these results in Section 6. 
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2 Definition 



Let (pk)k>o and (7fe)fe>o be probability distributions on the nonnegative in- 
tegers, and let A > 0. We exclude the (degenerate) case when p 1 — 1; in fact 
the reader may for convenience assume that p\ = 0, since this only amounts 
to a time- change. 

The continuous-time Markov chain (X(t),Y(t)) t >o, taking values in Z+, 
was informally described in Section 1. To recapitulate the main points, each 
healthy cell is replaced by k > new healthy cells at rate pk- Being replaced 
by k = new cells corresponds to dying. Each infected cell is replaced by 
k > 1 new infected cells at rate p k . When an infected cell dies, which occurs 
at rate p + A, a random number of healthy cells are converted into infected 
cells. If t is the time of such an event, we draw a random variable T t from the 
distribution (7fc)fc>o independently of other events. If T t < X(t) we simply 
declare T t of previously healthy cells to be infected, while if T t > X(t) we 
declare all previously healthy cells to be infected. To define this process 
formally, we list the different possible jumps in Table 1, where we use the 
notation x A k = min{x, k}. 





Transition from (x, y) to 


Rate 


Valid for 


(i) 


(x-l + k,y) 


xp k 


k > 


(ii) 


(x, y — 1 + k) 


VPk 


k > 1 


(iii) 


(x - (x A k), y - 1 + (x A k)) 


y(po + A)7 fc 


k > 



Table 1: Transition rates for the process {X(t), Y(t)) t >o- Rates are given for 
transitions from a state (x, y) and are valid for all x, y > 0. 

Note that there may be several transitions in Table 1 leading to the same 
state. In such cases the correct interpretation is to add the corresponding 
rates. An example of this is the transition (0,y) — > (0,y — 1), which occurs 
at rate y(po + A), which is the sum over all k > in (iii). To avoid trivial 
cases, we assume throughout that X(0),Y(0) > 1. Biologically it might be 
most relevant to consider the case when p k = for k > 3, but all our results 
are valid in greater generality, so we make no such restriction. 

We now state some immediate properties of the model. If it were the case 
that Y (t) = 0, then healthy cells would evolve as a Markov branching process, 
with intensity 1 and offspring distribution (pk)k>o- Similarly, if X(t) =0 for 
some t, then (Y(t+s)) s >o would behave like a Markov branching process with 
the higher intensity (1 + A) and an offspring distribution derived from (pk)k>o 
by placing more mass on k = 0. When both X(t),Y(t) > 0, as transition 
rate (iii) tells us, healthy cells may turn into infected cells. This scenario 
hence 'helps' the process (F(t)) t > and 'hurts' the process (X(t)) t >o- 
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3 Preliminary results 



In this section we establish several lemmas which will be used in the proofs 
of Theorems 1.2 and 1.4. Although their motivation may not be obvious on 
a first reading, we find it convenient to collect all such preliminary results 
here so as not to interrupt the flow of the main proofs later. 

A note on notation: we will sometimes write a sum of the form Ylt=i x k 
where a is non-integer. The correct interpretation is that the sum goes to 
the integer part [a\ but we prefer to omit the |_-J to keep the notation more 
readable. A similar comment applies also in other places throughout the 
paper. 

3.1 Auxiliary random variables 

It will at several points be useful to compare X and Y to two 'larger' processes 
X and Y. Here X may be thought of as the healthy process in the absence 
of infection, and Y as the infected process in an infinite 'sea' of healthy cells. 

To be precise, we let X and Y be two branching processes with lifelength 
intensities 1 and 1 + A, and offspring distributions (p k )k>o and (q k ) k >o, re- 
spectively, where (q k ) k > is given by 

7o(po + A) p k + 7 fc (po + A) 
g = 1 + A , and q k = for k > 1. (1) 

In Table 2 we give a list of the rates used for the coupling of (X, Y) to (X, Y). 
However, before that, we give an intuitive explaination. 

We start with equal sizes, X(0) = X(0) and Y(0) = Y(0). Each indi- 
vidual in X(0) is paired with a unique 'friend' in X(0), and each individual 
in Y(0) is paired with a unique friend in Y(0). Whenever a cell in X either 
multiplies or dies a natural death (transition (i) in Table 1) then its friend 
in X undergoes the exact same transition, and the offspring are paired in 
the natural way. Similarly, whenever a cell in Y multiplies (transition (ii) in 
Table 1) then its friend in Y undergoes the exact same transition, and again 
the offspring are paired in the natural way. When a cell Y has a lysis (tran- 
sition (iii) in Table 1), sample a random variable T with distribution (7fc)fc>o- 
Infect TAX cells from X, but let the friends in X of the newly infected 
cells in X remain unchanged (but lose their friends, existing as singletons). 
Proceed by letting the friend in Y of the cell in Y which underwent lysis be 
replaced by T new cells. Finally, pair the newly infected cells, now belonging 
to Y, with the new cells of Y. Note that if T > X, the this will result in 
some of the cells in Y being unpaired. 
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Thus every element of X always has a friend in X, and every element of 
Y always has a friend in Y; but some cells in X and Y might be unpaired. 
We let unpaired cells give rise to independent Markov branching processes 
with the correct intensities and offspring distributions. The rates of the 
coupled process (X, X, Y, Y) are summarized in Table 2. As before, the 



Transition from (x, x, y, y) to state 


Rate 


Valid for 


(x — 1 + k,x — 1 + 


k,y,y) 




k > 


(x,x-l + k,y,y) 




(x - x)p k 


k>0 


(x,x,y - 1 + k,y - 


-1 + k) 


ypk 


k > 1 


(x,x,y,y- 1 + k) 




(y - y)Pk 


k > 1 


(x — (x A k), x, y — 


l + {x Ak),y -1 + k) 


y(Po + A)7fc 


k>0 


(x,x,y,y- 1 + k) 




(y-y)(po + A)7fc 


k>0 



Table 2: Transition rates in the coupled chain (X,X,Y,Y). Rates are given 
for transitions from a state (x, x, y, y) and are valid for all x, x,y,y > 0. Note 
that the ordering x < x, y < y is preserved. 



correct interpretation is to add the rates of transitions leading to the same 
state. We note that our coupling satisfies the following: 

1. X(t) < X(t) and Y(t) < Y(t) for all t > 0; 

2. if X(t) ^ then Y(t) = Y(t). 

For a probability vector n = {jk : k > 0) we write n for the mean 
J2k>o k n k- Let a and (3 be the Malthusian parameters of X and Y, respec- 
tively, given by 

a = p-l, /3 = (g-l)(l + A). 

It is well known [9] that X(t)/e at and Y(t)/e^ are martingales which con- 
verge almost surely to some nonnegative random variables. We have that 
P(A U B) = 1 where 

A = \X(t) = for some t > 0} and B = { liminf \og(X(t))/t > 0}. (2) 

Moreover, P(A) = 1 if and only if a < 0. On the event B, the limit 
liim._j.oo log exists and equals a. The corresponding statements hold 
for Y(t) with a replaced by (5. Note for future reference that 

,_ (? _ 1)(1 + A) _ ( _±i(a±__ l)(1 + A) (3> 

= p — 1 + po7 + A(7 -!) = « + po7 + A(7 - 1). 
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Next, let U, V, W and $ denote random variables with the following 
distributions. Firstly, U and V have the distributions of (the sizes of) X(l) 
and Y(l), respectively, when X(0) = 1 and Y(0) = 1. Secondly, W has 
the distribution of X(l), where X is a branching process, started at 1, with 
lifelength intensity 1 and offspring distribution it given by 7r = 0, 7Ti = 
Po + pi, and 7Tfc = pk for k > 2. Thus X is essentially X with deaths 
suppressed. Finally, to define <3> run a sample of Y for time 1, started with 
Y(0) = 1; for each branching event that occurs during this time sample an 
independent Bernoulli random variable with success probability po, and let 
L denote the total number of successes. Let $ have the distribution of a sum 
of L independent copies of T. Thus $ is, intuitively, the number of infection 
attempts during a unit time interval starting with one infected cell. 

Lemma 3.1. Let r > 1 and let D denote a random variable with distribu- 
tion {p k )k>o- Then 

1. E{U r ) < oo if E(D r ) < oo, 

2. £(V r ) < oo if E{D r ) < oo and E(T r ) < oo, 

3. £(W r ) < oo if E(D r ) < oo, 

4. < oo if £(r) < oo and £(£>) < oo. 

Proof. From [2, Corollary III.6.1], we know that a branching process with 
offspring distribution it has finite rth moment at time t > if tt has its rth 
moment. This immediately gives parts 1 and 3. Part 2 follows from (1), 
which implies that (qk)k>o has its rth moment if (pk)k>o and (7fe)fc>o do. For 
the final part, note that 

E($) = E^TjliL > j}) = E(T)E(L). 

An easy (stochastic) upper bound on L is given by Y(l) where Y is a branch- 
ing process with intensity 1 + A and offspring distribution (qk)k>o, where 
Qo — Qi — 0, ?2 = 12 + Qi + qo, and = q^ for k > 3. Thus i?(L) is finite if 
E(T) and E(D) are finite, as in part 2. □ 

We will in what follows always assume that (pk)k>o has finite second 
moment, since this is part of the assumptions in Theorems 1.2 and 1.4. By 
Lemma 3.1 this implies that E(U 2 ) < oo, E(V 2 ) < oo, E(W 2 ) < oo and 
E(Q) < oo. This will allow us to apply Chebyshev's bound, which we will 
use in the following form. Let Zj (j > 1) be independent, all with the same 
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nonnegative mean E(Z) > and finite variance Var(Z) < oo as some random 
variable Z. Let N > 1 be any integer and let 5 > 0. Then 



N N ,2 



p(5>,- > (i + <^([E z i- ^] > N 2 s 2 E(zy) 

i=i j"=i (4) 

iVVar(Z) 1 Var(Z) 



~ N 2 5 2 E(Z) 2 N 5 2 E(Z) 2 ' 

Similarly 

P (^,<(l-W))4-^|. (5) 



3.2 Estimates 

The following lemma says that Y cannot be much larger than X for very 
long without making X extinct. This lemma will be the main step in the 
proof of the case /3 > a in Theorem 1.2, which is the case when the process 
Y grows much faster than X. In the statement of the lemma, we let W be 
as in Lemma 3.1, and let £ be a Bernoulli variable with success probability 
1 — e _ ( 1_7o ^ 0+A ^ (this being the probability of a lysis leading to at least one 
new infection occuring in a time interval of length 1). We fix c > and let 
S(t) > be any function such that 

n5(n)>ilog( 2 ^) (6) 

for all sufficiently large n. We write 

A n = {Vt G [n,n+ 1],0 < X(t) < e {c - 5{t))t < e {c+5{t))t < Y(t)}. 

Lemma 3.2. There is a constant C > such that for n large enough that (6) 
holds, 

P(A n ) < Ce- (c - 5(n))n . (7) 

In particular, we can take C = 9(Va 1 r(W)/E(W) 2 + Var(£)/£(0 2 )- It follows 
that P(A n \.o.) = 0. 

Before turning to the proof we remark that we only actually use this 
lemma with 5 constant. We prove this slightly more general result since very 
little extra work is required, and we hope that it will be useful for future 
work. 
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Proof. The result is trivial if S(n) > c so we assume that S(n) < c; we also 
assume throughout the proof that n is large enough that (6) holds. Suppose 
that A n occurs. Let <£>„ denote the number of infection attempts during the 
time interval [n, n + 1] , that is to say the sum of an independent sample 
of T for each lysis of (Y(t) : t E [n,n + 1]). Let £( n ) be obtained from 
(Y(t) : t £ [n,n + 1]) as follows. Start by numbering the elements of Y(n) 
(arbitrarily); then observe those elements numbered at most e^ c+5 ^ n until 
they undergo a branching event; let £j be the indicator of the event that 
cell j has a branching event which results in a lysis for which the associated 
T-value is at least 1 = if there is no branching event before time n+ 1); 
finally let f W be the sum of the Then ^ has the following properties: 



2. is a sum of e(. c + 5 ( n )) n independent Bernoulli variables, each with 
success probability 1 — e~ p °( 1_7o ^ 1+A ), and 

3. £ (n) is independent of (X(t) :te[n,n+ 1]). 

Next, let denote the total number of healthy cells that ever exist in 

the time-interval [n,n + 1]. Of course, if W {n) < $„, then A n cannot occur 

since this would imply that X(n + 1) = 0. We cannot immediately conclude 

from the fact that X(t) < e^ 6 ^ for every t G [n,n + 1], that W {n) is 

bounded by e ( c -^( n + 1 ))( n + 1 ). However must be stochastically bounded 

by the sum of e^ c ~ s ^ n independent copies Wj of the random variable W in 

Lemma 3.1. (Recall that W is, intuitively, X(l) when deaths are suppressed.) 

Also, W {n) is independent of Thus, writing a n = e {c ~ 5{n))n and b n = 
e (c+s(n))n^ we have that 



Note that b n /a n = e 2nS ^ > e ^g(2E(w)/E(0) = 2E(W)/E(£), by (6). We get 



1. < $, 



P(A n ) < P(WW > e (n) ) < P(Y,W 3 > J^Cj) 



3=1 3=1 
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that 

9Vax(W) 9Var(£) 
" a n E(Wy + KEW 1 

where we use (4) and (5). This gives (7). That P(A n i.o.) = follows from 
the Borel-Cantelli lemma. □ 

Recall that if U(t) is a Markov branching process with Malthusian pa- 
rameter u then W(t) = U(t)/e ut is a martingale. We make no claim as to the 
originality of the following lemma, yet have not seen it explicitly formulated. 

Lemma 3.3. Let U(t) be a branching process whose offspring distribution 
has finite second moment and with Malthusian parameter u > 0. 

1. For any A > we have that 

P(3t > : W{t) > A) < A -1 . 

2. For each e > there is some k > such that 

P(3t > r : < W{t) < e~ et ) < e~ KT . 

Proof. The first part is simply a consequence of Doob's submartingale in- 
equality, which gives that for any T > 0, 

P(3t G [0,T] : W(t) >A) = P( sup W(t) > A) < E[W(T)]/A = A -1 . 

Letting T — > oo concludes the proof of this case. 

For the second part, we proceed by discretizing. Let /i = E[U(1)] = e u 
and let W n = U (n) / fi n for every n G N. It is no loss of generality to assume 
that e < u/2. The limit W := lim„ W n exists a.s. since (W n ) n >i is a 
nonnegative martingale. A straightforward and standard calculation (see for 
instance [9, p. 13]) shows that for any r > n, 

E[fi n (W r - W n f] = a 2 {^ 1 + fi- 2 + ■ ■ ■ + n~ r ), 
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where a 2 = Var(C/(l)). Therefore by Fatou's lemma 

E[(W - W n f] < liminf E[(W r - W n f] = -^-^ n (8) 

r-*-oo /i — 1 

for all n. Hence by Markov's inequality 

P(\ W - W n \ > e— ) < E[{W ~ ^" )2] < -^e-( M - 2£ >. 
Vl 1 ' ~ e- 2en ~ p - 1 

It is well known (see for instance [9, Theorem 8.3]) that there exists a con- 
stant c 3 > such that for any interval I C (0, oo) we have P(W e I) < c 3 \I\. 
Furthermore, it is also well known [9, Theorem 8.4] that there exists a con- 
stant c 4 > such that P(W — 0, W n ^ 0) < e~ C4n . Therefore (adjusting c 3 
as necessary) 

P(0 < W 7 "™ < e~ e ™) < P(W = 0, W n > 0) + P(0 < W < 2e~ £n ) 

+ P(\W-W n \ > e- £n ) (9) 
< c 3 (e- C4n + e- £n + e- (u - 2£)n ). 

Clearly 

P(3s > t : < W(s) < e- £S ) < P(3n > t : < W n < e^ 2 ) 

+ P(3s > t : < W(s) < e~ es , Vn > t W n = or W n > e- £n/2 ). (10) 

We have bounded the first probability on the right hand side in (9). The 
second probability is bounded above by 

P(|j{3s G [n,n+ 1] : W{s) < e~ £n , W n > e~ £n/2 }) 

n>t 

< J]P(3s e [n,n + 1] : W(s) < e~ £n \ W n > e- £n/2 )P(W n > e~ £n/2 ) 

n>t 

< J]P(3s e [n,n + 1] : C/(s) < e u(ri+1) " eri | C/(n) > e un - £ "/ 2 ). 

It therefore suffices to show that each of the summands is exponentially small 
in n for large enough n. 

To establish this we take the following point of view. Let M = U(n) 
and label the particles present at time n by 1, 2, ... , M. If particle j has a 
branching event with zero offspring we say that particle j is destroyed. If it 
has a branching event with one or more offspring, we consider particle j to be 
still present, essentially identifying it with one of its offspring particles. With 
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this convention, we let Aj denote the event that particle j is ever destroyed 
during the time interval [n,n + 1]. Thus P{Aj) < 1 for all j, and the events 
Aj are independent. If U(s) < e u ( n + 1 )- £n for some s E [n, n + 1] then at least 
M — e u(n+i)-en Q f foe even t s 4 . must occur. But since M > e un - £n / 2 



M M 



3=1 3=1 

M 



P (J2 1a >- M ~ e M(n+1) - £n ) <p(^2l Aj > M(l - e u e~ l 

3=1 
M 

<p(^l, j >MP(A J )(l + 5)) 



3=1 

for large enough n and some 5 > 0. The latter probability is by (4) at most 

C/M < Ce- (u - £/2)n . 
This gives the result. □ 



3.3 A lemma about order statistics 



The following result will be used in the case a > (3 in Theorem 1.2, but may 
also be of independent interest. The first part essentially goes back to [1] (in 
the case p — 2), but we have not found the second part in the literature. 

If (^)i<j<M is a sequence of indentically distributed random variables, 
we let X(x) < X( 2 ) < • • • < denote the order statistics of {Xj)i<j< M . 

Lemma 3.4. Let (X/)i<?<m ^ e as aDove - 

1. If p > 1 and \\XiWp = E[X{] l / p < oo then for each subset A C 
{1,...,M}, 

(11) 



j&A 



where m = \A\ and 1/p + 1/q = 1. 
2. If the Xi are independent and < oo, then 



M-rn M 

Var( X U)) < Var(^X,) = M ■ Var(X 1 ). (12) 



3=1 3=1 

Proof. The first part is a consequence of Holder's inequality: 

i/p 



E 



M 



X, 



e[Y^x (j) i{j e A] 

3=1 

1/p 



M l j M 



i/q 



M 



E 



3=1 



X, 



\A\ 



1/9 



3=1 



3=1 
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For the second part, let X denote the sequence (X±, . . . , X M ) and let 

M-m M 

f(X) = £ X U) and g(X) = £ X (j) . 

j=l j=M-m+l 

Note that both / and g are increasing functions in the sense that if x = 
(xi, . . . , x n ) and y — (y±, . . . , y n ) satisfy X{ < yi for every % — 1, . . . , n, then 
f(x) < f(y) a ndg(x) < g(y). Thus also f(X)-E[f(X)\ and g(X) -E[g{X)\ 
are increasing functions. It follows from Harris' inequality that 

E[(f(X) - E[f(X)])(g(X) - E[g(X)])} 

> E[f(X) - E[f(X)]]E[g(X) - E[g(X)}] = 0, 

that is to say 

M-m M 

Cov( X U))>°- 

j=l j=M—m+l 

It follows that 

M M-m M 

Var(^X,)=Var(^X ( , )+ £ X {j) ) 

j=l j=l j=M-m+l 

M-m M 

= Var(^X ( , ) )+Var( £ X {j) ) 

j=l j=M-m+l 

M-m M 

+ 2Cov(^X ( , )! X U)) 

j=l j=M-m+l 
M-m 

> Var (£*(,->) ■ 

3=1 

□ 

Setting m = 1 in (11) we deduce that E[X^m)\ is of order at most M 1 ^ 
when the Xi have finite p:th moment. Results of this type, usually formulated 
for p = 2, go back to [1, 5, 10] 

Note that (11) is in some sense sharpest when A = {M — m + 1, . . . , M} 
because then the sum consists of the m largest terms; this is the case we will 
be using. 
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4 Proof of Theorem 1.2 



Clearly (by (2)) ( = if either a < or (3 < 0, so we assume henceforth that 
a, j3 > 0. The proof of Theorem 1.2 will be divided into the three cases (i) 
a < (3, (ii) a = f3 and (hi) a > (3. 

The case a < (3. The intuition is that if coexistence were to take place, then 
Y{t) would eventually be much larger than X(t); but then there is a good 
chance that all healthy cells are infected in, say, time 1, which would con- 
tradict coexistence. To make this intuition exact, let c = (a + (3)/2, 5 = 
(f3 — a)/4 > 0, and use (2) to see that 

P(T U = oo) = P(T U = oo, 3t : X(t) < e (c ~ 5)t < e {c+5)t < Y(t), Vt > t Q ) 
< P{3t : < X(t) < e {c ~ s)t < e ic+5)t < Y{t), Vt > t ), 

(13) 

since on {T u = oo} we have that < X(t) < X(t) and Y(t) = Y(t) for every 
t > 0. Trivially, the right hand side is bounded above by P(A n i.o.) where 

A n ■= {Vt e [n, n + 1], < X(t) < e^ 1 < e^ 1 < Y(t)}. 

But P(A n i.o.) = by Lemma 3.2. □ 

The case a = (3. For the case a = (3 > the intution is that there will 
typically be so many infection events that X effectively (i.e. counting losses 
due to infections) has a strictly larger rate of deaths than X, allowing us to 
essentially reduce this case to the case a < f3. Note that the process 

Rm Y(t) Y(t) e at 



X(t) e at X{t) 

converges almost surely to some random variable R, since Y(t)/e at and 
X(t)/e at are nonnegative martingales. The limit R may be infinite, but 
on the event {T u = oo} we have that < R < oo. Furthermore, since 
is an absorbing state for the process R(t) we have (up to a null event) that 
{T u = oo} C {inf t > -R(t) > 0}. It follows that for each r > we have 

{T u = oo} C {0 < inf R(t) < r} U G r , 

where 

G r = jifij > r Vt > o} n {X(t)Y(t) > Vt > 0}. 
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For each 5 > we may choose r > sufficiently small so that P(0 < 
inf 4 > R(t) <r)<5 and thus C < 5+P(G r ). We aim to show that P(G r ) = 
for each r > 0; since 5 > was arbitrary this will complete the proof. 

Fix 5, r > as above. We will couple X, X and Y to a new process 
X' which is obtained by taking into account some of the effect of Y on 
X. The process X'(t) will be a Markov branching process and will satisfy 
X'(t) < X{t) for all t > 0. Moreover, on the event G r we will have that 
X(t) < X'(t) for all t > 0. We let X'(0) = X(0). The rates governing the 
quadruple (X,X',X,Y) are given in Table 3, where we have written 




Transition to state 


Rate 


For 


(x + k — l,x' + k — 


l,x + k - l,y) 


(x A x')p k 


k > 


(x + k — l,x',x + k 


-hy) 


(x — x A 


fc > 


(x, x' + k — 1, x + k 


-hy) 


(a;' — rr A 


fc > 


(x, x', x + k — 1, y) 




(x — x V 


fc > 


(x, x', x, y + k — 1) 






k > 1 


(x,x',x,y- 1) 








(x — (x A k), x', x, y 


- 1 + (x A k)) 


S/(Po + A)7fc(l - «) 


k > 1 


(x — (x A k), x' — 1, 


x,y - 1 + (x A k)) 


y(Po + A)7fc« 


fc > 1 


(x, x' — 1, x, y) 




(ra/ - kj/)(po + A)(l - 70 ) 





Table 3: Transition rates in the coupled chain (X, X', X, Y). Rates are given 
for transitions from a state (x, x', x, y) and are valid for all x, x', x, y > 0. 



We note from Table 3 that the triple (X,X,Y) has the correct marginal 
distribution, i.e. as described in Section 3.1. For example, summing the first 
two lines the of the table gives the rate (x Ax' + x — x A x')pk = xpk for the 
transition x — > x + k — 1. Similarly, x — > x + k — lat rate given by the sum of 
the first four lines, and using that x + x' — x A x' = x \/ x' we get the correct 
rate xpk- 

Consider now the marginal distribution for X' . First note that, since 
k < rx'/y, the final rate in Table 3 is nonnegative. Adding the rates for the 
transitions x' — >■ x' — 1, we find that this transition occurs at rate 

x'{j> Q + rip Q + X){l - 7o )). 

Together with the rates for x' — > x' + k — 1 for k > 1, this means that X'(t) 
is a Markov branching process with lifelength intensity 

l + r(p + A)(l-7o) 
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and offspring distribution p' given by 

, = Po + ^(po + A)(1-7q) 

Po ~ l + r ( Po + A)(l-7o)' 
/ = Pk , 1 

Pfc l + r ( Po + A)(l-7o)' " ' 
In particular, the Malthusian parameter of X' is 

a' = (l + r( Po + X)(l- lo ))(p'-l) 

= a-r(po + A)(l-7o) 
< a for r > 0. 

Clearly < for all t > 0. On the event G> we also have that 

y(t) = y(t) for all t > and that Y(t)/X(t) > r for alU > 0. It follows 
that, on G>, we have that 

Y(t) Y(t) Y(t) r „ 
r < -A-L = -M- < — Vt for all t > 0, 
" X(t) X(t) ~ X'(t) 

so that rX'(t)/Y(t) < 1 and hence k(X' (t) ,Y (t)) = rX'(t)/Y(t). Thus the 
final rate in Table 3 is always on the event G r , and hence so is the second 
rate. Therefore, we get that G r C {X(t) < X'(t)Vt > 0}. 

Let c = (a + a')/2 and 5 = (a — a')/ A > 0. Using (2) we therefore deduce 
that 

P(G r ) < P(3t : < X(t) < X'{t) < e^ 1 < e^ 1 < Y(t), Vt > t ). 

(14) 

By Lemma 3.2, the probability on the right equals zero. Since 5 > was 
arbitrary it follows that £ = 0. □ 

We are now ready to prove the final case of Theorem 1.2. 

The case a > f3. The intuition here is that X(t) 'wants' to be of the order e at 
and Y(t) 'wants' to be of the, much smaller, order e 131 . Typically, therefore, 
the infection will have very little impact on the healthy population. 
To make this intuition rigorous, let 

n 2 ™ 1 n 1 

^rK 1 -^)- b "=^-wh c »=n( i +F)' 

k — 2 k — 2 k — 2 

Note that a n and b n form decreasing sequences with limits in (0, 1) and that 
c n is an increasing sequence with limit in (1, oo). Write B n for the event that 

X{n) > a n e an and b n e Pn <Y(n) < c n e Pn . 
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We will prove that there is some N such that 

P(B n+1 \B n )>l-\ (15) 

for all n > N. This will, using the Markov property, establish the result, 
since P(B N ) > and 

C > P(n n > N B n ) = P{B N ) J] P{B n+1 | B n ) > 0. 

n>N 

We start by observing that (again using that Y(t) = Y(t) whenever 
X(t) > 0) 

P(B n+1 | B n ) (16) 
= P(X(n + 1) > a n+1 e^ n+1 \b n+1 e^ n+1 ^ < Y{n + 1) < c n+1 e^ n+1 ^ \ B n ) 
> 1 - P(X(n + 1) < a n+l e a ^ \ B n ) - P(Y(n + 1) < & n+1 e# n+1 > | B n ) 
-P(Y(n + l)>c n+1 e^ n+1 ^ \B n ). 

We will proceed to show that all three probabilities on the right hand 
side are small. To prove that P(X(n + l) < a„ +1 e Q ( n+1 ) | B n ) is small, let $ n 
denote the number of infection attempts during the time interval [n, n + 1] , 
as in the proof of Lemma 3.2. We will first show that $ n will typically be 
much smaller than X(n), and will deduce from this the required lower bound 
on X(n + 1). For the bound on $„, we use Markov's inequality to see that 

p(*. > i s») < c J ( *;l^ EW * (^rp. w 

where $ is the random variable of Lemma 3.1 and we used the fact that, 
given B n , the number $ n of infection attempts is dominated by the sum of 
c n e^ n independent copies of $. 

Let M = M(n) = a n e an and m = m(n) = c n e /3n (n + 1) 2 E($) (so X(n) > 
M on B n , and m is the quantity in (17)). Let {Uj)i<j<M denote independent 
copies of the random variable U of Lemma 3.1. The lower bound on X(n + 
1) will be obtained by noting that the impact of infection during the time 
interval [n, n + 1] can be no larger than the effect of removing, at time n, 
those <!>„ healthy cells that would otherwise give rise to the largest ancestry 
at time n + 1. In particular, 

X(n)-$„ 

X(n + 1)> U U), ( 18 ) 
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where U(\) < Up) < ••• < U(m) denote the order statistics of U\,...,Um 
as in Section 3.3. For n large enough we have M > m, and on the event 
B n fl {<£>„ < m} we have 



X (n+1)> U U> 
j=i 



(19) 



Recall that E(Uj) = e a . From the first part of Lemma 3.4, we have that 

M 

E [ E M =0 (^) = ( M ^^) 



j=M-m+l 



Observe that a n+ ie a( - n+lS> = (l — j^-^)M ■ e a and that for large enough 



we have that 

M-m 



n 



M-m 



3=1 



p{ E u o) < - ^tif) m • e ") = P ( E U V) - E [H u ] < 

M—m M—m 

-p{J2 u u)- E [J2 u u) 

3=1 3=1 
M—m M—m 

^(E^-^E^-) 



2Me a 
"(n+1) 2 



3=1 3=1 

M 

+e[ E %]) 



(n+1) 2 



j=M-m+l 



3=1 



Me a 
' (n+1) 



0- 



(20) 



By Chebyshev's bound (4) and the first part of Lemma 3.4, 

M—m M-m (n + l) 4 Vs* ( ^fT™ U {j) ) 

K E ^ - * E < <J5S0 < }«ul j 

3=1 3=1 

(n + l) 4 Var(^i) _ 

Taking into account also (17) it follows that 

P{X(n + l)>a n+1 e a ^ \ B n ) 

> P(X(n + 1) > a n+1 e a(ri+1) | $ n < m, B n )P($ n <m\B n ) 



M-m 



> P 



( J2 U U) > a n+1 e^) (l - 

3=1 



1 



> (l-0(e" 



1 - 



n+1) 2 
18 



> 1 - 



(n + 1) 2 
2 



n + 1) 2 ' 



for n large enough. 

We proceed with the second and third probabilities on the right hand 
side of (16). We have, with Vj independent and having the distribution of 
V in Lemma 3.1, using that E(V) = eP, (5) and that Y(t) = Y(t) whenever 
X(t) > 0, 

P{Y{n + l)<b n+l e^ n+ ^\B n ) 

= P(Y(n + 1) < b n+1 e^ n+ V | X(n) > a n e an ,b n e? n < Y{n) < c n e Pn ) 
= P(Y{n + 1) < 6„ +1 e /3(n+1) | b n e^ n < Y{n) < C y n ) 
< P(Y(n + 1) < b n+1 e^ n+ V | Y(n) = b n e? n ) 



b„e 



= ^(E^<(i-( TW )^"^)) 

j=l 

1 (l+n) 4 VarQ/) 
" & n e*» £(^) 2 1 J - 

Similarly, but using (4) in place of (5) , 

P(Y(n + l)>c n+1 e^ n+1 ^\B n ) 

< P(Y(n + 1) > c n+1 e^ n+ V | Y(n) = c n e Pn ) 

= p(jTV 3 >c n+l e^) 

3=1 



(l + n)= 
1 (1 + n) 4 Va r(y) 

We conclude that (15) holds for n large enough. □ 



5 Proof of Theorem 1.4 

The proof of Theorem 1.4 will be in two parts. 

TTie case ct < 0. It is well known (see [9, Theorem 11.1]) that the proba- 
bility that a subcritical branching process survives until time t > decays 
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exponentially fast in t. That is, there exists c > such that for every t > 0, 



P(X(t) > 0) < e - rf . 

Letting T x = inf{t : X(t) = 0} we get that E[T U ] < E[T X ] < oo. □ 

The case < a < j3. Similarly to (13) let c = (a + j3)/2 and 5 = (j3 — a)/A, 
and note that c — 5 = a + 5 and c + 5 = /3 — 5. We have that 

P(T U > t) (21) 
= P{T U > r,X(t) < e {c ~ s)t < e {c+s)t < Y(t), Vt > r/2) 
+^(T U > r, {X(t) < e {c - 5)t < e {c+S)t < Y(t), Vt > r/2} c ) 

< P{T U > r,X{t) < e {c ~ s)t < e (c+5)t < Y(t), Vt > r/2) 

+P(T U > r, 3t > r/2 : X(t) > e {a+5)t or 3t > r/2 : < Y(t) < e^- S)t ) 

< P(T U > T,X(t) < e {c - 5)t < e {c+5)t < Y(t), Vt > r/2) 

+P{3t > r/2 : X(t) > e {a+5)t or 3t > r/2 : < Y{t) < e^- &)t ). 

For the first part of the right hand side of (21), we consider (for simplicity) 
first the case r = 2n, where we get 

P{T U > r,X(t) < e (c - 5)t < e (c+5)t < Y(t), Vt > r/2) 

= P(T U > 2n,X(t) < e (c - 5)t < e (c+s)t < Y(t), Vt > n) 

< P(0 < X(t) < e ic ~^ < e ic+s ^ < Y(t), Vt G [n,n + 1]) = P(A n ), 

where A n is as in Lemma 3.2. According to that lemma, there exists a c<i > 
such that for any n, we have that P(A n ) < e~ 2c ' 2Tl = e~ C2T . It is easy to see 
that the same holds for all r (adjusting ci if necessary). 

For the second part of the right hand side of (21), we use Lemma 3.3, to 
conclude that there exists a c\ — ci(5) > such that for any r, 

P{3t > r/2 : X(t) > e {a+5)t U 3t > r/2 : < Y{t) < e^- 5)t ) 

< P(3t > r/2 : X(t) > e at+Sr ' 2 ) + P(3t > r/2 : < Y(t) < e^ 1 ) 

< e~ 5T/2 + e" ClT . 

We conclude that there exists C3 > such that P(T U > t) < e~ C3 * for any 
t > 0, and so E[T U ] < 00. □ 

Remark 5.1. Clearly E[T U ] = 00 when a > (3 > 0, since then T u takes 
value 00 with positive probability. We have not been able to determine in 
general whether or not E[T n ] is finite in the remaining case a — (3, but in the 
following special case it is easily seen to be finite. Suppose a = (3 = 0, 70 = 1 
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and A = 0. Then X and Y form independent critical branching processes. 
The extinction times T x and T Y for these respective processes satisfy 

P(T X > t) ~ 1 P(T y > i) ~ 1; 

see [2, p. 159]. Thus T u = min{T x ,T y } satisfies 

P(T U > i) = P(T X > t)P(T Y > t) ~ i 

so 

/oo poo POO ^ 

p(t u > < i + y p(t u > *)tft ~ i + y — < cx). 



6 Applications of the main results 

In this section we will briefly discuss some applications of our main theorems. 
Using our results on coexistence we are able to comment more on the issue 
of extinction of Y, which was the main focus of [3]. 

Central to the analysis in the present article were the auxiliary processes 
X and Y. Recall that Y was in essence the process Y in an 'infinite sea of 
food', i.e. X(0) = oo. However, if instead X(t) = 0, then (Y(t + s)) s >o has no 
healthy cells to feed on, and therefore (Y(t + s)) s > grows at the exponential 
rate (see also (3)) 

(3' = p-l-X = a-\. 

The qualitative behavior of (X(t), Y(t)) t > depends on the values of a, (5 and 
/3'. We discuss the possible different regimes. 

Regime 1. If a < then (X(t)) t >o eventually dies out, and since j3' < a, 
so does (Y(t)) t >o- Hence rj — 1. 

Regime 2. If < a < f3 then if 7 > it might be the case that (Y(t)) t >o 
dies out spontaneoulsy. However, if it does not, then according to Theo- 
rem 1.2, instead (X(t)) t >o will go extinct. If /3' < 0, we then conclude that 
also (Y(t)) t >o dies out, that is rj — 1. However, if j3' > then (Y(t)) t >o can 
survive on its own, that is rj < 1. 

Regime 3. If < (3 < a we are in the coexistence regime, in particular 
rj < 1. As stated in Theorem 1.2, it might be the case that X(t)Y(t) > 
for all t > 0. However, as in Regime 2, if 7 > 0, it is possible that (Y(t)) t >o 
dies out. Furthermore, if (X(t)) t > dies out, then the behavior of (Y(t)) t > 
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would again be governed by the sign of ft'. 

Regime 4. If ft < then (Y(t)) t >o eventually dies out, that is rj — 1. 

We can draw qualitative conclusions from the above description, using 
also (3). For instance, if we fix a > and E(T) > 1 it follows that a < ft 
for every A > 0, and so we are always in Regime 2. As long as A is small 
enough, so that ft' > 0, the process (Y(t)) t >o can survive. This supports the 
intuition that small A is good for the long term survival of (Y(t)) t >o, see [3]. 

If instead a > while E(T) < 1 we see that we are in Regime 2 for small 
values of A and in Regime 3 for large values of A. Depending on the exact 
values of a and E(T) we have the following possibilities: 

• for small A we have < a < ft, and ft' > so that (Y(t)) t > might 
survive, that is rj < 1; 

• for slightly larger A we can have < a < (3, and ft' < so that (Y(t)) t >o 
dies out, that is rj = 1; 

• for larger A we have < ft < a, so that (Y(t)) t >o might again survive, 
that is i] < 1; 

• for even larger A we have ft < so that (Y(t)) t >o, again dies out, that 
is 7] — 1. 

In [3], monotonicity of rj as a function of A was established when 7 = 0. 
In contrast, we see here that monotonicity of i] in A may fail if E(T) < 1 
(and it is easy to find specific parameters for this to be the case). Note also 
the difference between the first case, in which (Y(t)) t >o is strong enough to 
survive on its own, and case three where (Y(t)) t >o needs the process (X(t)) t >o 
to feed on. 
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